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^ ABSTRACT 
>< 

^ • The construction of a catalogue of galaxy groups from the 2-degree Field Galaxy 

■ Redshift Survey (2dFGRS) is described. Groups arc identified by means of a fricnds-of- 

friends percolation algorithm which has been thoroughly tested on mock versions of the 
2dFGRS generated from cosmological N-body simulations. The tests suggest that the 
algorithm groups all galaxies that it should be grouping, with an additional 40 per cent 
of interlopers. About 55 per cent of the ~ 190 000 galaxies considered are placed into 
groups containing at least two members of which ~ 29 000 are found. Of these, 7000 
contain at least four galaxies, and these groups have a median redshift of 0.11 and a 
median velocity dispersion of 260 km s^^. This 2dFGRS Percolation-Inferred Galaxy 
Group (2PIGG) catalogue represents the largest available homogeneous sample of 
galaxy groups. It is publicly available on the WWW. 
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1 INTRODUCTION 

Groups of galaxies are useful tracers of large-scale structure. 
They provide sites in which to study the environmental de- 
pendence of galaxy properties, the galactic content of dark 
matter haloes, the small-scale clustering of galaxies, and the 



interaction between galaxies and hot X-ray emitting intra- 
group gas. Thus, it is desirable to have an extensive, ho- 
mogeneous catalogue of groups of galaxies representing the 
various bound systems in the local universe. 

Many studies in the past relied upon the pioneering 
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work of Abell (1958) to provide a set of target galaxy clus- 
ters (see also Abell, Corwin & Olowin 1989, Lumsden et al. 
1992 and Dalton et al. 1997 for similar studies). However, 
because of the lack of redshift information available when 
Abell was defining his cluster catalogue, concerns have been 
raised over the completeness of his sample and the impact 
that line-of-sight projections would have in contaminating 
it (e.g. Lucey 1983; Sutherland 1988; Erenk et al. 1990; van 
Haarlem, Frenk & White 1997). As a result of these worries, 
it became fashionable to select galaxy cluster samples 1)ased 
upon cluster X-ray emission (e.g. Gioia et al. 1990; Romer 
1995; Ebeling et al. 1996; Bohringer et al. 2001), this method 
being less prone to projection effects. This strategy never- 
theless brings its own complications, because X-ray emission 
depends sensitively on the details of intracluster gas physics. 

The construction of galaxy redshift surveys over the 
past twenty years has enabled a number of groups to pur- 
sue the optical route to group-finding. Huchra & Geller 
(1982), Geller & Huchra (1983), Nohhenius & White (1987), 
Ramella, Geller & Huchra (1989) and Moore, Frenk & White 
(1993) used subsets of the Center for Astrophysics (CfA) 
redshift survey, containing a few thousand galaxies, to in- 
vestigate both the abundance and the internal properties of 
samples of a few hundred galaxy groups. Maia, da Costa 
& Latham (1989) extended the database using the Southern 
Sky Redshift Survey of a further ~ 1500 galaxies and identi- 
fied a sample of 87 groups containing at least two members. 
All-sky galaxy samples of ~ 2400, 4000 and 6000 galax- 
ies were used for group-finding by Tully (1987), Gourgoul- 
hon, Chamaraux & Fouque (1992) and Garcia (1993) respec- 
tively. Deeper surveys of small patches of sky have also been 
used to find groups by Ramella et al. (1999, ~ 3000 galax- 
ies) and Tucker et al. (2000, ~ 24 000 galaxies). Giuricin et 
al. (2000), Merchan, Maia & Lambas (2000) and RameUa 
et al (2002) have performed analyses on catalogues contain- 
ing up to ~ 20 000 galaxies, but the largest set of galaxy 
groups so far is that provided by Merchan & Zandivarez 
(2002). They used the ~ 60 000 galaxies in the contiguous 
Northern and Southern Galactic Patches (NGP and SGP) in 
the '100k' public data release of the 2dFGRS (Colless et al. 
2001). This work extends that sample to the NGP and SGP 
regions in the complete 2dFGRS*. The entire survey con- 
tains about 220 000 galaxies, ~ 190 000 of which are in the 
two contiguous patches once the completeness cuts detailed 
below have been applied. 

As well as using more galaxies than were previously 
available, this study also contains the results from rigorous 
tests of the group-finding algorithm. These are facilitated by 
the construction of very detailed mock 2dFGRSs, created 
using a combination of dark matter simulations and semi- 
analytical galaxy formation models. Comparing the proper- 
ties of the recovered galaxy groups to those of the underly- 
ing parent dark matter haloes provides a robust framework 
within which the parameters of the group-finding algorithm 
can be chosen so as to apportion the galaxies optimally. Fur- 
thermore, the ability to relate, in a quantitative fashion, the 
input mass and galaxy distributions to the set of recovered 



* The 2dFGRS data release is described by Colless et al. (astro- 
ph/0306581) and the data can be found on the WWW at 
http://www.mso.anu.edu.au/2dFGRS/. 



galaxy groups, is of crucial importance when trying to ex- 
tract scientific information from the group catalogue. This 
approach is identical in spirit to that adopted by Diaferio et 
al. (1999) when they studied the northern region of the CfA 
redshift survey. 

The choice of group-finding algorithm is described in 
Section 2. Details of the mock catalogue construction and 
group-finder testing are given in Sections 3 and 4. Section 5 
contains the results when the group-finder is applied to the 
real 2dFGRS. 



2 THE GROUP-FINDER 
2.1 Choice of algorithm 

Given a set of galaxies with angular positions on the sky 
and redshifts, the task of the group-finder is broadly to re- 
turn sets of galaxies that are most likely to represent the 
true bound structures that are being traced by the ob- 
served galaxies. For some applications, not missing any of 
the true group members will be particularly desirable. For 
others, minimising the amount of contamination by nearby, 
yet physically separate, objects will be the priority. It is in- 
evitable that some distinct collapsed objects, situated near 
to each other in real space and along a similar line-of-sight, 
will be spread out by line-of-sight velocities to the extent 
that they overlap with one another. Thus, some unavoid- 
able contamination is to be expected. The aim of the group- 
finder described here is to find a compromise between the 
extremes of finding all true members and minimising con- 
tamination, with a view to providing groups that have veloc- 
ity dispersions and projected sizes similar to those of their 
parent dark matter haloes. Naturally, the efficiency of such 
a group-finder can only be calibrated and tested when the 
properties of the associated dark matter haloes are known. 
The use of realistic mock catalogues is tiius central to the 
entire group-finding procedure because it directly affects the 
composition of the final group catalogue through the choice 
of parameters for the group-finder. 

To date, the job of finding groups in galaxy redshift sur- 
veys has typically been assigned to a percolation algorithm 
(see Tully 1987, Marinoni et al. 2002 and references within 
Trasarti-Battistoni 1998 for alternative approaches) that 
links together all galax;ies within a particular linking vol- 
ume centred on each galaxy. These friends-of-friends (FOF) 
algorithms are specified by the shape and size of the linking 
volume and how it varies throughout the survey. In order 
to produce galaxy groups corresponding to a similar over- 
density throughout the survey, the linking volume should be 
scaled to take into account the varying number density of 
galaxies that are detected &a a function of redshift. Previous 
studies have not all chosen the same scaling for the link- 
ing length with mean observed galaxy number density, n. 
At higher redshifts, fiux-limited catalogues contain a lower 
number density of galaxies, causing the mean intergalaxy 
separation to increase. The algorithm proposed by Huchra 
& Geller (1982) scales both the perpendicular (in the plane 
of the sky) and the parallel (line-of-sight) linking lengths 
and in proportion to n^^^'''. Ramella, Geller & Huchra 
(1989) scaled both linking lengths in proportion to n"^''^. 
In contrast, Nolthenius & White (1987) and Moore, Frenk 
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& White (1993) chose to set to correspond to the typ- 
ical size in redshift space ol groups detected as a function 
of redshift. The perpendicular linking lengths were scaled 
in proportion to n~^^^, this being how the mean projected 
separation varies. Thus, in contrast with the other methods, 
the aspect ratio of this linking volume is not independent of 
redshift. 

In choosing how to scale the linking lengths, there is one 
primary condition that one would like to satisfy. Namely, 
that for a particular group of galaxies sampled at varying 
completeness, the edges of the recovered group should be in 
similar places. If this is achieved, then the inferred velocity 
dispersion and projected size and the actual contamination 
should be independent of the sampling rate. Scaling both 
£± and £\\ by n~^^^ will lead to groups of similar shape and 
overdensity being found throughout the survey. Of course if 
the galaxy distribution is sampled very sparsely then this 
scaling can lead to linking lengths that arc large with re- 
spect to the size of real gravitationally bound structures so, 
depending upon the nature of the galaxy survey, it may be 
desirable to put an upper limit on the size of the linking 
volume. The maximum value of the perpendicular linking 
length is one of the parameters of the algorithm used here. 



2.2 The linking volume 

Having defined how the linking volume scales with mean 
observed galaxy number density, the choice of its shape 
and size still remains. The shape of the linking volume 
should clearly be spherical in the case where real galaxy dis- 
tances are measured. However, with redshift space distances, 
groups will appear elongated along the line-of-sight, and one 
can appreciate that in order to find all of the group mem- 
bers, £|| > £± would be helpful. An approximate estimate of 
the amplitude of this stretching can be made by consider- 
ing how rapidly galaxies will be moving in a halo of mass M 
and radius r. If the circular velocity satisfies = GM/r, the 
line-of-sight velocity dispersion can be written as w 
and the total mass is M = 4/37rr^AcPc, where pc is the 
critical density and Ac defines the mean density relative to 
critical within the halo, then 

(2.1, 

Setting Ac = 150 yields 

- « 600kms"V(/i"^Mpc). (2.2) 
r 

(Note that, according to the spherical 'top-hat' model Ac « 
100, 180 for ilo ~ 0.3, Ao = 0.7 and fio = 1 models respec- 
tively; Eke, Cole & Frenk 1996) Thus, for an object with 
virial radius r, the velocities produce a la stretch along the 
line-of-sight of ~ 6r. The ratio of parallel to perpendicular 
sizes is independent of r. This ignores the redshift measure- 
ment errors which will add in an r-indcpendent parallel con- 
tribution of iTerr ~ 85kms~'^ for the 2dFGRS (CoUess et al. 
2001) with no corresponding perpendicular increase. Even 
at the lowest redshifts this error term is smaller than the 
typical line-of-sight linking length for the 2dFGRS. Conse- 
quently, the shape of the linking volume should be elongated, 
with a particular aspect ratio given by R^ai = i\\/i±: and 



the appropriate value for this ratio should be ~ 12 to enclose 
2(T of the galaxies along the line-of-sight. 

In dark matter N-body simulations, friends-of-friends 
group-finders are often applied in real space with a linking 
length that is 6 = 0.2 times the mean intcrparticle separa- 
tion in order to identify groups having overdensities that are 
~ (Davis et al. 1985). This choice of linking length has 
been shown by Jenkins et al. (2001) to yield a halo mass 
function that is independent of redshift and fto, and thus 
provides a good definition of the underlying dark matter 
haloes. Since redshift space, rather than real space distances 
are available in the 2dFGRS, it is unclear quite what value of 
6gni, in terms of the mean intcrgalaxy separation, is appro- 
priate to reproduce the boundary of the groups as defined 
by a 6 = 0.2 set of dark matter groups. The parameter b will 
be used to set the overall size of the linking volume through 
£j_ = h/n^'^. The semi-analytical model of Cole et al. (2000) 
predicts that light is typically more concentrated than mass 
in groups. Thus, a value of 6gai smaller than 0.2 is likely to 
be appropriate for recovering the b = 0.2 set of dark matter 
groups from a galaxy survey. 

This discussion implies that the linking volume should 
be elongated along the redshift direction, and provides rough 
expectations for both the aspect ratio and overall size of the 
volume that will best recover the underlying dark matter 
haloes. However, the precise shape of this volume is yet to 
be specified. Both a cylinder and an ellipsoid could satisfy 
the above requirements. In the absence of peculiar veloci- 
ties, an ellipsoid reduces smoothly to the usual real space 
spherical linking volume, but tests on mock 2dFGRSs re- 
veal that a cylinder is slightly more effective at recovering 
groups that trace the underlying dark matter haloes. This 
empirical motivation leads to a cylindrical linking volume 
being employed throughout this paper. 

2.3 Empirically motivated fine tuning 

While the picture painted in section 2.2 is pleasingly simple, 
with both the aspect ratio and the linking volume being in- 
dependent of group mass, in practice the optimum linking 
volume to recover the & = 0.2 set of dark matter groups 
from the galaxy survey is not quite this universal. There 
are a number of reasons why this similarity breaks down. 
For instance, the mass to light ratio in the mock catalogues 
varies with halo mass such that light is less concentrated 
in more massive clusters. Also, halo concentration depends 
on mass, and concentration aifects the halo velocity struc- 
ture and total enclosed overdensity (when the halo edge is 
defined by an isodensity contour). Using the galaxy popula- 
tion which sparsely samples the dark matter groups, these 
factors combine to yield a small systematic bias in the re- 
covered group properties. Tests with mock catalogues have 
shown that this produces either small haloes with overesti- 
mated sizes (and velocity dispersions), or large clusters with 
sizes that are underestimated. Across the range of masses 
from 10^^ - lO^'^ Mq this amounts to a 20 per cent ef- 
fect in the projected size and slightly smaller in the velocity 
dispersion. To correct for this requires knowledge of the halo 
mass in which each galaxy resides. An estimate of this quan- 
tity is made by measuring the galaxy density relative to the 
background in a cylinder with aspect ratio J?gai and a co- 
moving projected size of 1.5 Mpc. This density contrast. 
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A, is then used to scale both the size and aspect ratio of the 
linking cylinder according to 



b = b, 



gal 



(Afit) 



and 



H, Ft: 



(2.3) 



(2.4) 



where Afit, £(, and sr are parameters to be fitted from the 
mock catalogues. This enables the removal of the biases de- 
scribed above, while effectively increasing the spread in link- 
ing lengths at a given redshift by a few tens of per cent. 



2.4 The mean galaxy number density 

In addition to varying with redshift, the mean observed 

galaxy number density, n, depends on the depth to which 
a particular region of sky was surveyed, and the efficiency 
with which redshifts were measured. The production of maps 
that describe the angular variation of the survey magnitude 
limit, 6j™(^), the redshift completeness, R{9), and a func- 
tion, fi{9), related to the apparent magnitude dependence of 
the redshift completeness, is described in section 8 of CoUess 
et al. (2001). These quantities, along with a galaxy weight, 
w, that models the local completeness of the 2dFGRS, were 
combined to define n at the position of each galaxy. 

Galaxy weights were calculated by removing all fields 
in the 2dFGRS that have a completeness less than 70% and 
then all sectors (areas defined by the overlap of 2dFGRS 
fields) that have a completeness less than 50%. Rejecting all 
galaxies from fields of low completeness eliminates from the 
sample the small amount of data that was taken in poor ob- 
serving conditions. Rejecting sectors with low completeness 
removes regions which are incomplete due to the fact that 
some 2dFGRS fields were not observed or are excluded by 
the above cut. Unit weight is then assigned to all the galax;- 
ies of the parent APM catalogue in the remaining sectors. 
All of these galax;ies without measured redshifts have their 
weight redistributed equally to their 10 nearest neighbours 
with measured redshifts. Since low completeness sectors are 
excluded, the weights produced are never large. Their mean 
value is 1.2 with an rms dispersion of 0.2. The inverse of the 
weight, 1/w, is a local measure of the completeness in the 
2dFGRS around each galaxy. 

The model for the redshift completeness as a function 
of apparent magnitude is 



Cz(&j,9) = 7[l-exp(6j-At(e))], 



(2.5) 



where 7 is a normalization factor determined by the overall 
completeness in a given direction. The incompleteness in a 
given direction is a result not only of the failure to obtain 
redshifts for some of the faintest galaxies, but also of the 
failure to target galaxies either because the corresponding 
2dFGRS field was not observed or because of constraints 
on the fibre positioning. To determine the completeness as- 
sumed in constructing the mock 2dFGRS catalogues, 7 is set 
in each sector by demanding that the completeness averaged 
over apparent magnitude 

0.(9)= N{b3)c4bj,9)dbj/ N{bj)dbj (2.6) 



be equal to the measured overall completeness, R{9) , in that 
sector. Here, the integrals are over the apparent magnitude 
range of the survey, from a global bright magnitude limit 
to the local faint magnitude limit. A simple power-law fit 
to the observed number counts, N{b,{) cx lO""''''', is used. 
In contrast, to model the completeness in the genuine or 
constructed mock catalogue, 7 is fixed at the position of 
each galaxy by demanding that Cz{9) = 1/w. That is, the 
inverse of the weight assigned to each galaxy is taken as a 
local measure of the completeness in that direction. In the 
case of the genuine survey this hcis the advantage that it 
will automatically take account of any small scale variation 
in the completeness that might occur due to the constraints 
on fibre positioning. Having fixed 7, the comoving number 
density of galaxies at each angular position and redshift is 
computed from the 2dFGRS luminosity function as 



n[z,9)= $(M(6j,«))c.(6j,6')d6j. 



(2.7) 



The luminosity function used here is that estimated by Nor- 
berg et al. (2002), convolved with their model of the 2dFGRS 
magnitude measurement errors. 



2.5 The linking criteria 

Taking all of these survey characteristics into account, and 
defining a maximum perpendicular linking length in physical 
coordinates as I/x,max, the comoving linking lengths associ- 
ated with a particular galaxy are 



b 

ni/3 



(2.8) 



and two galaixies i and j, at comoving distances rfc.i and dc j 
with an angular separation 9ij, are linked together if 



2 V dc.i d, 



and 



\dc,i — dc,j\ < 



(2.9) 



(2.10) 



The conversion of observed redshift to comoving distance 
(Rox) requires an assumption about the cosmological model. 
This impacts both on the inferred galaxy comoving space 
density, as n oc (dV/dz)-'^ oc [RoS{x)]'^^^ / H{zy^^ (see 
e.g. Peacock 1999) and on the comoving distance for a 
given redshift. The angle subtended by the chosen linking 
length scales like 9 oc n~^^^ /{Rox)- Throughout this paper 
no = 0.3 and Ao = 0.7 will be used. This is appropriate for 
the mock catalogues and not obviously very wrong for the 
real one (Spergel et al. 2003). At the median survey redshift 
of 0.11, this model yields an angular linking length that is 
almost two per cent larger (and a comoving distance that is 
about five per cent larger) than an Einstein-de Sitter model. 



3 CONSTRUCTING A MOCK 2dFGRS 

It is clearly important to understand how the galax;y groups 
discovered by this percolation algorithm are related to the 
underlying distribution of dark matter in the universe. To 
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address this issue and, at the same time, determine the opti- 
mum set of parameters to use in the group-finder, mock 2dF- 
GRSs have been constructed using high resolution N-body 
simulations of cosmological volumes and a semi-analytical 
model of galaxy formation. 

The main N-body simulation used is the ACDM GIF 
volume described by Jenkins et al. (1998). The density pa- 
rameter is fio = 0.3, the cosmological constant is Ao = 0.7 
and the normalisation of density fluctuations is set so that 
the present-day linear theory rms amplitude of mass fluc- 
tuations in spheres of radius 8h~^ Mpc, as = 0.9. The box 
size is 141.3 Mpc. Another simulation with 288^ parti- 
cles in a ACDM box of length 154 h'^ Mpc with as = 0.71 
has also been used to test the sensitivity of the optimum 
group-finding parameters to the amplitude of the mass fluc- 
tuations. It turns out that the optimum parameter choice 
is insensitive to this change in erg, although the amount of 
spurious contamination does increase by ~ 10 per cent when 
this decrease in the contrast between groups and not groups 
is applied. The GIF volume will be used in the subsequent 
analysis. 

Dark matter haloes are identified in these simulation 
cubes using a friends-of-friends algorithm with a linking 
length of 6 = 0.2 times the mean interparticle simula- 
tion. The kinetic and potential energies of grouped particles 
are computed and unbound particles are removed from the 
group. Bound groups of 10 or more particles are retained, 
giving a halo mass resolution of 1.4 x 10^^ Mq (see Ben- 
son et al. 2001 for further details). 

The reference semi-analytical galaxy formation model 
of Cole et al. (2000) is used to populate the bound haloes 
identified in the z — output of the N-body sinmlation 
following the prescription outlined in Benson et al. (2000). 
The halo mass resolution of the N-body simulation in turn 
imposes a resolution limit in the semi-analytical calcula- 
tion. This corresponds to the absolute magnitude of central 
galaxies that occupy dark matter haloes that have ten or 
more particles. Scatter in the formation histories of galaxies 
and variable amounts of dust extinction cause a spread in 
the relationship between the luminosity of a central galaxy 
and the mass of the host halo. As a working definition, the 
magnitude limit of a « = volume-limited galaxy cata- 
logue constructed from the N-body simulation is taken to 
be Mb, — Slogj^g/i = —17.5; at this luminosity, 90% of cen- 
tral galaxies predicted in a semi-analytical model calculation 
without any halo mass limitations reside in haloes resolved 
by the simulation. 

The luminosity resolution of the volume-limited cat- 
alogue, when combined with the global k + e correction 
adopted by Norberg et al. (2002), implies that a mock 2dF- 
GRS survey constructed from this simulation output will 
only be complete above a redshift of z = 0.08. The median 
redshift of the 2dFGRS is « = 0.11, so it is desirable to ex- 
tend the mock catalogue to fainter luminosities. This was 
done by constructing a volume limited sample of galaxies 
from a separate semi-analytical calculation for haloes with 
masses less than the N-body resolution limit. These galaxies 
were then assigned at random to the particles in the simu- 
lation that were not part of a bound halo. This should be 
a reasonable approximation because the clustering of dark 
matter haloes is almost independent of mass for masses be- 
neath the resolution limit of the GIF simulation (Jing 1998). 



Using this technique, the luminosity limit was shifted to 
Mfjj — 51ogjQ/i = —16.0, so that a flux limited catalogue 
constructed from this output would be complete above a 
redshift of 2 = 0.04. 

A mock 2dFGRS was constructed from the volume- 
limited galaxy catalogue by applying the following steps: 

0) A monotonic transformation was applied to the magni- 
tudes given by the semi-analytical model, perturbing them 
slightly so as to reproduce the 2dFGRS luminosity func- 
tion. The reason for doing this is that the galaxy luminosity 
function of the semi-analytical model is not a perfect match 
to the measured 2dFGRS luminosity function (see Fig. 1 
of Benson et al. 2000) and it is desirable that the selection 
function of the mock catalogue should accurately match that 
of the genuine survey. The magnitudes are then perturbed 
using the model of the 2dFGRS magnitude measurement er- 
rors described in Norberg et al. (2002). 

1) The volume-limited catalogue was replicated, about a 
randomly located observer, to take into account the much 
greater depth of the 2dFGRS volume compared to the size 
of the N-body simulation box. This has no impact upon the 
tests of the group-finder presented here, although it does 
mean that the mock is unsuitable for studying the cluster- 
ing of groups on scales approaching the size of the simulation 
box. 

ii) Galaxies were then selected from within this volume by 
applying the geometric and apparent magnitude limits of 
the 2dFGRS survey defined by the map, 6j'°(0), of the sur- 
vey magnitude limit (see Section 2 and Colless et al. 2001 
section 8). This produces a mock catalogue in which every 
galaxy has a redshift. 

iii) The appropriate redshift completeness was then imposed 
sector by sector on the mock catalogue, by randomly retain- 
ing galaxies so as to satisfy the function Cz{bj,d) (equa- 
tion 2.5). 

This method ignores any systematic variation of the com- 
pleteness within a sector, and in particular it does not take 
account of closc-jjair incompleteness on angular scales less 
than ~ 0.75 arcmin. To investigate the effect of this, a sec- 
ond set of mocks was also produced, in which close pairs 
were identified in the parent mock catalogue and preferen- 
tially rejected when reproducing the incompleteness in each 
sector. Not all close pairs are missed in the 2dFGRS because 
the large overlaps between 2dFGRS fields permits difiierent 
members of close pairs to be targetted on different fields. 
One statistic that can be used to quantify the level of close- 
pair incompleteness is the ratio of the angular correlation 
function of objects that have measured redshifts to that of 
the full parent catalogue (Hawkins et al. 2003). To repro- 
duce the appropriate level of close pair incompleteness in the 
mock catalogue, the parameters of the rejection algorithm 
(the angular scale of the close pairs and the fraction that 
are rejected) were tuned so as to reproduce this statistic. In 
practice, the recovered groups hardly varied at all with the 
inclusion of the close pair incompleteness. This is because 
of the combination of the large amount of field overlap in 
the 2dFGRS and the small angles over which fibre clashes 
become important. 

Every galaxy in the mock survey is either spawned by 
a dark matter halo containing at least ten particles or is 
a background galaxy that is located at the position of an 
ungrouped dark matter particle. In the following section, the 



© 0000 RAS, MNRAS 000, 000-000 



6 V.R. Eke et al. (the 2dFGRS Team) 



phrase the number of galaxies spawned by a particular dark 
matter halo will be used to refer to the subset of galaxies 
belonging to that halo which make it through the observing 
procedure and into the mock survey. 

The use of a realistic mock 2dFGRS catalogue to set the 
parameters required for the group-finder and to interpret the 
nature of the derived groups represents a clear advance over 
previous work, which either neglected to present any such 
tests or, at best, used mock catalogues constructed from 
dark matter only simulations. One criticism that could be 
levelled at the approach presented here is that it is model 
dependent. However, direct tests have been performed to 
confirm that the optimum group-finding parameters arc in- 
sensitive to catalogues created with a 20 per cent lower value 
of (78 (and appropriate adjustments to the semi-analytical 
galaxy formation model). It should be borne in mind that 
the default model does provide a very reasonable description 
of the real universe. In particular, it is in excellent agroomont 
with a number of observables that have a direct bearing on 
group-finding in the 2dFGRS: the local luminosity function 
in the fej-band (see figure 10 of Madgwick et al. 2002; Cole et 
al. 2000), the clustering of luminous galaxies (Benson et al. 
2000) and the dependence of galaxy clustering on luminosity 
(Benson et al. 2001; Norberg ct al. 2001). Any alternative 
model would also need to reproduce all of these relevant ob- 
servations for the testing of the group-finder to be similarly 
appropriate. 



4 TESTING THE GROUP-FINDER 

The mock surveys, complete with their parent dark matter 
haloes, provide a database on which to optimise the param- 
eters of the group-finder. These are mainly the maximum 
perpendicular linking length (I/±,max), the aspect ratio of 
the linking cylinder (-Rgai) and the number of mean inter- 
galaxy separations defining the perpendicular linking length 
(6gai). In the remainder of this paper, the additional tweaks 
to the linking volume described in Section 2 will be set to 
values obtained empirically: 



Afit = 5; €6 = 0.04; en = 0.16. 



(4.1) 



The process of deciding which is the best set of group- 
finding parameters requires a definition of what is good. A 
good group-finder should find a high fraction of the available 
groups, provide accurate estimates of their size and veloc- 
ity dispersion, avoid splitting them up into subgroups and 
minimise the number of interloping galaxies. In an attempt 
to quantify these qualities, the following four statistics have 
been defined: 

1. Completeness, c, is the fraction of detectable dark matter 
haloes that have more than half of their spawned galaxies 
in a single galaxy group. A dark matter halo is detectable if 
it spawns at least two galaxies into the mock survey. 

2. The median accuracy, or Or, is the median log^g of 
the ratio of associated galaxy group to dark matter group 
velocity dispersion or projected size. The associated galaxy 
group is the one that both contains the largest number of 
the galaxies spawned by this dark matter halo (or the group 
with most members overall if there is a tie), and is asso- 
ciated to this halo. A galaxy group is associated with the 
dark matter halo that spawned most of its members. If this 



is not unique, then the most massive of the possible dark 
matter haloes is chosen. With these definitions, it is possi- 
ble for more than one galaxy group to be associated with the 
same dark matter halo, but that dark matter halo will have 
only one associated galaxy group. The set of values from 
which Oct and Or come is constructed by matching every de- 
tected dark matter halo with its associated galaxy group. A 
detected dark matter halo is one with an associated galaxy 
group. The spread about the median is described by the 
semi-interquartile range, s^ or s,.. 

3. Fragmentation, /, is the mean number of extra galaxy 
groups per dark matter halo having mass, defined later in 
this subsection, at least 0.2 times that of their associated 
dark matter halo for all detected dark matter haloes. 

4. The quality, g, of an individual halo to group match is 
defined as 

Afgood — ATbad 



9 = 



(4.2) 



where A^good is the number of member galaxies spawned by 
this halo that are found in the associated galaxy group, Abad 
is the number of group members not spawned by this halo 
and Aspawn is the total number of galaxies spawned by this 
dark matter halo. 

The velocity dispersions of the galaxy groups were calculated 
using a variant of the gapper estimator described by Beers, 
Flynn & Gebhardt (1990). This is an efficient estimator that 
is resistant to outlying velocities. Tests on groups found from 
mock catalogues showed that this estimator gave an that 
varied less with the minimum number of galaxies per group 
than other choices. The gapper estimator also yielded an s^ 
that was at least as low as the others for groups of all sizes. 
In detail this involves ordering the set of recession velocities 
{vi} of the N group members and defining gaps as 



Qi = Vi+l - Vi, I 

Using the following weights 
Wi = i{N — i), 



(4.3) 



(4.4) 



the gapper estimate of the velocity dispersion can then be 
written as 



N(N 



_ N-l 
TT \ - 

i = l 



(4.5) 



Under the assumption that one of the galaxies is moving at 
the centre-of-mass velocity of the halo, which is certainly 
true for the mock catalogues, the estimat ed velocity disper- 
sion is multiplied by an extra factor of N/{N — 1) before 
the rcdsliift measurement 6rror, cr err 5 is removed in quadra- 
ture, giving 



■ A - 1 



(4.6) 



Velocity dispersions of the parent haloes, which inevitably 
have many more dark matter particles in them than there 
are galaxies in the associated groups, were calculated as the 
root mean squared velocity difference from the mean. 

The projected size of groups and haloes, r, is defined as 
the weighted root mean squared projected separation from 
the group centre of the members. This calculation is per- 
formed taking into account the galaxy weights resulting from 
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Figure 1. The variation of the comoving perpendicular linking 
length with redshift when -Lx,max = 2/i~^ Mpc and 6gai = 0.13 
for each galaxy in a mock SGP. Note that the parameter Lx,max 
is in physical coordinates, whereas t± is in comoving coordinates. 



the variable incompleteness, as described in Section 2.4. The 
centre is defined using an iterative method that first calcu- 
lates the arithmetic weighted mean position of the remain- 
ing galaxies then rejects the most distant galaxy. When only 
two galaxies remain, the position of the galaxy with a larger 
weight, or if these are the same, the larger flux, is deemed to 
represent the group centre. The projected group size is then 
the weighted rms projected physical separation of the other 
N — 1 galaxies from this central galaxy. This measurement, 
in conjunction with the velocity dispersion, can be used to 
estimate the group mass as 

M = A^, (4.7) 

where the value of A was chosen so that, for the optimum 
choice of group-finding parameters described later in this 
section, the median mass was unbiased. This led to a choice 
of A = 5.0. The fragmentation statistic was calculated using 
these galaxy group masses. 

These definitions provide a framework within which 
comparisons can be made between group catalogues re- 
turned by different group-finding parameters. A good set of 
parameter values will yield a set of galaxy groups that have 
a completeness near to 1, a median accuracy of (this being 
the log of the ratio of measured to true velocity dispersion 
or radius), independent of the group mass or redshift, with 
a small spread about this median, a fragmentation near to 
0, and a quality of ~ 1. 



4.1 Definitions 

4.2 Parameter optimisation 

In Section 2, the three main free parameters of the group- 
finder were described. These determine the overall size of 
the linking volume through 6gai, the maximum size of the 
linking volume via L±,max and the aspect ratio of the link- 
ing volume, defined by -Rgai. Applying the group-finder to 
the mock catalogues described in Section 3 using various 
group-finding parameters, the recovered galaxy groups can 
be compared with their parent dark matter haloes, and the 
optimum set of parameters can be found. The following three 
subsections detail the effect of varying each of these param- 
eters individually. 

4.2.1 Varying Lj_^ma:K 

The appropriate value for the maximum perpendicular link- 
ing length, I/x,max, should be similar to the size of the typi- 
cal objects that are detectable at larger redshifts, where the 
number density of galaxies is low and this limit becomes rel- 
evant. Values around a couple of physical Mpc are there- 
fore a good place to search. i?gai = 11 and fegai = 0.13 have 
been fixed. The justification for choosing these particular pa- 
rameter values is contained in the following two subsections. 
Fig. 1 shows the behaviour of the comoving perpendicular 
linking length with redshift for Lx^max = 2/i~^Mpc. The 
increase of £± with redshift reflects the decreasing mean ob- 
served rmmber density of galaxies, and the spread in linking 
length at a given redshift comes from the angular variation 
in survey depth and the fraction of galaxies with measured 
redshifts. Larger values of Lx.max lead to larger linking vol- 
umes at the highest redshifts and some associated additional 
contamination. Decreasing Lx.max yields underestimates of 
the projected sizes and velocity dispersions of the groups 
at redshifts where the limit affects the size of the linking 
volume. Thus Lx.max = 2 Mpc is chosen as a physically 
motivated compromise. 

4.2.2 Varying iigai 

As was estimated in Section 2, the appropriate choice for the 
axis ratio of the linking cylinder is ~ 10. A variety of values 
surrounding this one have been tried in conjunction with 
ix.max = 2/i~'^Mpc and 6gai = 0.13. Fig. 2 compares the 
properties of the dark matter haloes with those of their asso- 
ciated galaxy groups. The projected group sizes are fairly in- 
sensitive to _Rgai as would be expected considering that these 
changes only impact upon the line-of-sight linking length. In 
contrast, the velocity dispersions are affected, and the more 
elongated linking volumes yield larger values, with an unbi- 
ased median being returned when flgai « 11. As the linking 
volume is stretched, the completeness, fragmentation and 
quality change only very slightly, with the most significant 
change being a decrease of the quality of the group matches 
at higher redshift. Note that the drop in quality for the 
most meissive haloes is driven by the large number of high 
redshift, poorly sampled groups. To illustrate this, an addi- 
tional long-dashed line is included in the lowest two panels in 
the first column of Fig. 2 showing the results using only the 
groups at z < 0.15. This demonstrates that considering only 
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Figure 2. The effect of varying -Rgai for an SGP mock catalogue. Lx.max and 6gai are held fixed at 2 h^^ Mpc and 0.13 respectively and 
i?gal is given values of 7 (dotted), 11 (solid) and 15 (dashed). The top row shows the mean comijleteness as a function of dark halo mass, 
redshift and number of galaxies spawned by the dark matter halo. The next rows show the median accuracies of the velocity dispersions 
and projected sizes of the galaxy groups, and the spread around the median value for the R^ax = 11 case (upper solid lines without 
error bars). The 1 — it errors shown on the median i?gai = 11 curves are the errors on the mean accuracy calculated from the spread, 
assuming that the individual accuracies are distributed in a Gaussian fashion. The mean number of additional galaxy groups associated 
with detected dark matter haloes, as parametrised by the fragmentation, is displayed in the next row, followed by the mean quality of 
the group to halo matches in the penultimate row. This provides information about the difference between the numbers of good and bad 
member galaxies. The mean number of bad interlopers relative to the immbcr of galaxies spawned by the parent halo is shown in the 
final row. An extra long-dashed curve is shown in the two lowest panels in the first column to show how the quality of high mass halo 
matches improves when only z < 0.15 groups are considered for the case of Rgai = 11. This also produces a corresponding decrease in 
the fraction of interlopers, as shown in the bottom panel. 
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13 14 15 0.1 0.2 12 3 



Figure 3. The effect of varying 6gai for an SGP mock catalogue. With Lx.max = "2 h^^ Mpc and iJgai = 11, f>gal was given values of 
0.11 (dotted), 0.13 (solid, and long-dashed for the z < 0.15 subset in the two lowest panels in the first column) and 0.15 (dashed). All 
quantities shown are the same as those in Fig. 2. 



the nearby, well sampled massive groups yields a mean halo 
quality, and interloper content, that is comparable with the 
lower mass haloes. Using the requirement that the median 
velocity dispersion be faithfully measured selects i?gai = 11 
as the best choice. This is comparable with that suggested 
by the rough calculation leading to equation 2.2. Note that 
without the additional halo mass dependent tweaks to the 
linking volume discussed in Section 2, the values of which are 
given in Section 4.1, there would be a gradient of ~ —0.05 



in all median accuracy curves as a function of dark matter 
halo mass. 



4.2.3 Varying 6gai 

Keeping Lx.max = 2 Mpc and Rg^i = 11 fixed, the value 
of 6gai, the number of mean intergalaxy separations defining 
the perpendicular linking length, was varied from 0.11 up to 
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Figure 4. The redshift dependence of ratios of the memberships 
of the three sets described in the text. The labels on the y axis 
are ordered from bottom to top like the curves at z = 0.1. Thus, 
the fraction of all galaxies that have been spawned by haloes 
that spawn at least two galaxies (Ni/Nt) is shown with the solid 
line. The long dashed curve shows the fraction of all galaxies that 
are put into groups (Ng/Nt). The short dashed curve traces the 
fraction of grouped galaxies that are in set /. The dotted line 
shows the fraction of the Ni galaxies that are actually put into 
groups {Ngi/Ni), and the dot-dashed curve displays the ratio of 
the number of grouped galaxies to the number in set I. 

0.15, and the parent b = 0.2 dark matter haloes were again 
compared with the resulting galaxy groups. Fig. 3 shows the 
results. As the linking lengths are increased, the recovered 
median group velocity dispersions and projected sizes also 
increase as larger volumes are grouped together. This is in 
contrast to the results when i?gai was varied and only the 
velocity dispersions were much affected. These changes are 
significant for all parent dark matter halo masses. Along 
with these variations, the increasing linking lengths increase 
completeness, reduce fragmentation and decrease the quality 
of the matches. The least biased accuracies are produced 
when bgai = 0.13, and this is the value adopted for the rest 
of this paper. 

4.3 Summary of group-finding parameters 

The results from the testing described in the previous three 
subsections suggest that an appropriate choice of group- 
finding parameters is Lx,max = 2/i~^Mpc, 7?gai = 11 and 
bgai = 0.13. These provide a set of groups that have unbi- 
ased velocity dispersions and sizes. In doing this, they con- 
tain almost all of the galaxies that should be included in 
groups with at least 2 members, and some interlopers as well. 
Smaller values of all of the above three parameters should 
be employed if reducing contamination is of greater impor- 
tance than capturing as many true group members while 
not overestimating the velocity dispersion or projected size. 



The level of contamination in the groups found by this par- 
ticular choice of group-finding parameters is illustrated in 
Fig. 4. This shows the redshift dependence of ratios of the 
memberships of the following three sets: 1) t, representing all 
galaxies in the mock survey, 2) g, representing all grouped 
galaxies and 3) I, representing galEixies that come from dark 
matter haloes which have spawned at least two galaxies into 
the mock survey, i.e. those that could be linked to another 
galaxy spawned by the same parent halo, gl is used to denote 
the overlap between sets g and I, and Ni is the membership 
of group i. 

The dotted curve in Fig. 4 shows that the detected 
groups contain almost all of the galaxies that belong in set 
I for all redshifts. For rcdshifts less than 0.1, the fraction 
of all galaxies that are grouped is ~ 0.60, as shown by the 
long dashed curve, whereas the solid curve shows that the 
fraction of all galaxies in set I is only ~ 0.4. Of the grouped 
galaxies in this redshift range, only about 70 per cent of 
them are members of set I. This quantity is shown by the 
short dashed line in Fig. 4. 

The accuracy statistic relates only to the dark matter 
haloes that have spawned at leeist two galaxies (i.e. the de- 
tectable haloes), and have at least one associated galaxy 
group. For the above choice of group-finding parameters, 
about 51 per cent of galaxy groups are the best matches 
to detectable dark matter haloes, ~ 47 per cent are associ- 
ated with dark matter haloes that have only spawned one 
galaxy, ~ 2 per cent arc other matches with dark matter 
haloes and the remaining ~ 0.2 per cent are made up com- 
pletely from background galaxies that have been spawned by 
no well defined dark matter halo. Thus, a significant frac- 
tion of the recovered groups are not being used to determine 
the group-finding parameters. These groups usually have ve- 
locity dispersions and projected sizes that are larger than 
those of their parent dark matter haloes. Consequently, the 
measured velocity dispersion and projected size for typical 
groups containing 2 galaxies, will be overestimates of those 
of the underlying dark matter halo. However, it would be 
inappropriate to adapt the group-finding parameters to be 
unbiased when these spurious groups are included, because 
this would inevitably spoil the results for the good matches. 



5 APPLICATION TO THE 2dFGRS 

Together, the NGP and SGP regions in the 2dFGRS con- 
tain 191 440 galaxies once cuts of 70% and 50% have been 
applied for field and sector completeness respectively. This 
set of galaxies, with their appropriate weights so that the 
mean observed galaxy number density n can be defined for 
each galaxy according to equation 2.7, has been used along 
with the friends-of-friends group-finder described in Sec- 
tion 2 with L±,max = 2h~^ Mpc, J?gai = 11 and 6gai = 0.13, 
in order to find 'real' galaxy groups. The resulting catalogue 
places 55 per cent of all of the galaxies within 28877 groups 
with at least two members. A total of 7020 groups are found 
with at least four members, and their median redshift and 
velocity dispersion are 0.11 and 260kms~^ respectively. The 
corresponding quantities for the sets of groups with at least 
3 or 5 members are (A'groups, median z, median cr/ km s~^) 
= (12566, 0.11, 227) and (4503, 0.11, 286). 

The spatial distribution of the 2PIGGs containing at 
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Figure 5. The spatial distribution of groups containing at least 4 members in the NGP and SGP regions. Dot colour and size represent 
the group velocity dispersion and unweighted number of members respectively, as shown in the legend. 
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Figure 6. Histograms showing the distribution of group redshifts, velocity dispersions, masses and the weighted number of members 
from the real data. Only groups containing at least 4 members have been included. The solid and dashed histograms show groups found 

in the SGP and NGP respectively. 



least 4 galaxies is illustrated in Fig. 5. Each dot represents 
a group, with the colour and size representing the group ve- 
locity dispersion and weighted galaxy content respectively. 
Typical dot sizes decrease at large redshifts because the flux- 
limited survey means that large distant groups are sampled 
with only a few bright galaxies. AX z > 0.15 the typical 
velocity dispersion also grows. This happens both because 
only the bigger groups have enough bright galaxies to be re- 
covered, and the smaller group memberships produce larger 
errors on the measured velocity dispersions, thus increas- 
ing the abundance of groups with apparently large veloc- 
ity dispersions. Figure 6 shows the distributions of these 
groups with respect to redshift, velocity dispersion, mass 
and weighted number of members, for the NGP and SGP 
strips of the survey. While the SGP has almost 50 per cent 
more galaxies in it than the NGP, the fraction grouped (0.56 



and 0.54 for the NGP and SGP respectively) and the distri- 
butions of properties of the resulting groups axe very similar. 
The main difference is in the redshift distribution of groups, 
where the lower flux limit in the SGP betrays itself with a 
more extended distribution than the NGP. Consequently, a 
few extra high velocity dispersion, or equivalently high mass, 
clusters are found in the south. Although they are not re- 
produced here, these distributions of group properties are 
well matched to those found from the mock catalogues that 
were used to calibrate the group-finder. 

Figures 7 and 8 show how the group memberships and 
velocity dispersions vary with redshift. This ties together 
the information contained in Figures 5 and 6, showing how 
the typical number of galaxies per group decreases with in- 
creasing redshift, while the velocity dispersion increases with 
increasing redshift. 
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Figure 7. Histograms sliowing the distribution of weighted group 
memberships for groups containing at least 4 galaxies. These 
combine the data from NGP and SGP, and include groups in 
the following two redshift ranges: 0.04 < z < 0.08 (solid) and 
0.14 < ^ < 0.18 (dotted). 
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Figure 8. Histograms showing the distribution of velocity dis- 
persions for groups containing at least 4 galaxies. These combine 
the data from NGP and SGP, and include groups in the following 
two redshift ranges: 0.04 <z< 0.08 (soUd) and 0.14 <z< 0.18 
(dotted). 



One interesting comparison can be made with the re- 
sults of Morchan & Zandivaroz (2002) . They used the public 
100k data release version of the 2dFGRS and defined groups 
with a percolation algorithm similar to the one used here. 
For 2PIGGs containing at least four members, the mean 
redshift and velocity dispersion, 0.11 and 260kms~^, are 
very similar to their values of 0.105 and 261kms~^. The 
total number of these groups has increased from the 2209 
that Mcrchan & Zandivarez found to 7020 in the 2PIGG 
catalogue. This factor is similar to the increase in the to- 
tal numbers of galaxies being used, i.e. roughly the same 
fraction of galaxies are being grouped in both cases. 



6 CONCLUSIONS 

A friends-of-friends percolation algorithm has been de- 
scribed, calibrated and tested using mock 2dFGRSs and 
then applied to the real 2dFGRS in order to construct the 
2dFGRS Percolation-Inferred Galaxy Group (2PIGG) cata^ 
logue. 

From the mock catalogues, it is possible to determine 
the typical accuracies with which velocity dispersions and 
projected halo sizes are recovered. For detectable haloes re- 
covered with galaxy groups containing at least 4 members 
the estimates of the velocity dispersion are unbiased in the 
median and the ratio of inferred to true velocity dispersion 
has a semi- interquartile range of ~ 30 per cent. The cor- 
responding number for the spread about the median ratio 
of inferred to true projected size is ~ 35 per cent. Without 
the use of detailed mock catalogues to calibrate the group- 
finder, the ability to derive science from the 2PIGG cata^ 
logue would be compromised by the uncertainty in how the 



recovered groups related to the underlying distribution of 
matter. This is thus a crucial step in the group-finding pro- 
cedure. 

When applied to the two contiguous patches in the real 
2dFGRS, containing ~ 190 000 galaxies, this percolation al- 
gorithm groups 55 per cent of the galaxies into ~ 29 000 
groups containing at least two members. Focusing on those 
groups with at least 4 members, their median redshift and 
velocity dispersion are 0.11 and 260 kms~^ repectively. More 
detailed distributions of fundamental group properties are 
characterised in Section 5. 

This 2PIGG catalogue is the largest currently avail- 
able set of groups. It should provide a useful starting 
point for a number of studies concerning large scale struc- 
ture, galaxy group properties and the environmental de- 
pendence of galaxy properties. Upcoming papers will de- 
scribe in more detail the contents of the groups, for in- 
stance the galaxy luminosity functions within groups, the 
mass-to-light ratios of groups, the manner in which galax- 
ies are apportioned to different groups and the spatial 
distribution of galaxies within groups. The spatial abun- 
dance of groups both as a function of total group lumi- 
nosity and mass will also be investigated. The catalogue, 
including basic group properties, is available on the WWW, 
at http://www.mso.anu.edu.au/2dFGRS/Public/2PlGG/. 
A description of the contents of this web page is given in 
the appendix. 
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APPENDIX A: DETAILS OF THE CONTENTS 
OF THE WEB PAGE 

http://www.mso.anu.edu.au/2dFGRS/Public/2PIGG/ 
contains: 

1) the lists (NGP and SGP) of galaxies and the index of the 
groups in which they are placed, 

2) lists of group properties for the 2PIGGs, 

3) the equivalent lists for some mock catalogues. 

To illustrate the type of data that are available. Table Al 
shows the information provided for a subset of the galaxies 
and Tabic A2 contains a list of all 2PIGGs containing at 
least 100 members. 
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Table Al. Data for a small subset of the galaxies in the NGP. The quantities listed for each galaxy are: (1) right ascension in radians 
(1950 coordinates); (2) declination in radians (1950 coordinates); (3) redshift; (4) foj magnitude; (5) limiting 6j magnitude at this point 
in the survey; (6) n{z, 9) at this point in the survey (see equation 2.7); (7) galaxy weight, w, as described in Section 2.4; (8) the group 
number to which each galaxy is assigned (zero is ungrouped). 
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Table A2. Data for all 2PIGGs containing at least 100 galaxies. The group quantities listed are: (1) number of member galaxies; (2) 
right ascension of the group centre in radians (1950 coordinates); (3) declination of the group centre in radians (1950 coordinates); (4) 
group redshift; (5) rms projected galaxy separation m.h~^ Mpc; (6) group velocity dispersion inkms"^. 
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